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SEQUENCE LISTING 

<110> De Maria, Leonardo 
Andersen, Carsten 

Christensen, Lars Lehmann Hylling 
Lassen, Soren Flensted 
Ostergaard, Peter Rahbek 

<120> Protease Variants 

<130> 10508. 204-US 

<160> 21 

<170> Patentin version 3.3 

<210> 1 

<211> 1596 

<212> DNA 

<213> Nocardiopsis sp. NRRL 18262 ("Protease 10") 
<220> 

<221> CDS 

<222> (318) . . (1463) 

<220> 

<221> sig_peptide 
<222> (318) . . (404) 

<220> 

<221> mat_peptide 
<222> (900) . . (1463) 

<400> 1 

acgtttggta cgggtaccgg tgtccgcatg tggccagaat gcccccttgc gacagggaac 60 

ggattcggtc ggtagcgcat cgactccgac aaccgcgagg tggccgttcg cgtcgccacg 120 

ttctgcgacc gtcatgcgac ccatcatcgg gtgaccccac cgagctctga atggtccacc 180 

gttctgacgg tctttccctc accaaaacgt gcacctatgg ttaggacgtt gtttaccgaa 240 

tgtctcggtg aacgacaggg gccggacggt attcggcccc gatcccccgt tgatcccccc 300 

aggagagtag ggacccc atg cga ccc tec ccc gtt gtc tec gee ate ggt 350 

Met Arg Pro Ser Pro Val Val Ser Ala lie Gly 

-190 -185 

acg gga gcg ctg gcc ttc ggt ctg gcg ctg tee ggt ace ccg ggt 395 
Thr Gly Ala Leu Ala Phe Gly Leu Ala Leu Ser Gly Thr Pro Gly 

-180 -175 -170 

gcc etc gcg gcc ace gga gcg etc ccc cag tea ccc acc ccg gag 440 
Ala Leu Ala Ala Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu 

-165 -160 -155 



1 



gcc gac gcg gtc tec atg cag gag gcg etc cag cgc gac etc gac 
Ala Asp Ala Val Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp 

-150 -145 -140 



485 



etg ace tee gcc gag gcc gag gag ctg ctg gcc gcc cag gac acc 530 
Leu Thr Ser Ala Glu Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr 

-135 -130 -125 

gcc tte gag gtc gac gag gcc gcg gee gag gcc gcc ggg gac gcc 575 
Ala Phe Glu Val Asp Glu Ala Ala Ala Glu Ala Ala Gly Asp Ala 

-120 -115 -110 

tac ggc gge tec gtc tte gac acc gag age ctg gaa ctg acc gtc ctg 623 
Tyr Gly Gly Ser Val Phe Asp Thr Glu Ser Leu Glu Leu Thr Val Leu 

-105 -100 -95 

gtc ace gat gee gcc gcg gtc gag gee gtg gag gcc acc ggc gcc ggg 671 
Val Thr Asp Ala Ala Ala Val Glu Ala Val Glu Ala Thr Gly Ala Gly 
-90 -85 -80 

acc gag ctg gtc tec tac ggc ate gac ggt etc gac gag ate gtc cag 719 
Thr Glu Leu Val Ser Tyr Gly lie Asp Gly Leu Asp Glu lie Val Gin 
-75 -70 -65 

gag etc aac gee gcc gac gcc gtt ccc ggt gtg gtc ggc tgg tac ccg 7 67 

Glu Leu Asn Ala Ala Asp Ala Val Pro Gly Val Val Gly Trp Tyr Pro 
-60 -55 -50 -45 

gac gtg gcg ggt gac acc gtc gtc ctg gag gtc ctg gag ggt tec gga 815 
Asp Val Ala Gly Asp Thr Val Val Leu Glu Val Leu Glu Gly Ser Gly 

-40 -35 -30 

gcc gac gtc age ggc ctg etc gcg gac gcc ggc gtg gac gcc teg gcc 863 
Ala Asp Val Ser Gly Leu Leu Ala Asp Ala Gly Val Asp Ala Ser Ala 

-25 -20 -15 

gtc gag gtg acc acg age gac cag ccc gag etc tac gcc gac ate ate 911 
Val Glu Val Thr Thr Ser Asp Gin Pro Glu Leu Tyr Ala Asp lie lie 
-10 -5 -1 1 

ggt ggt ctg gcc tac acc atg ggc ggc cgc tgt teg gtc ggc tte gcg 959 
Gly Gly Leu Ala Tyr Thr Met Gly Gly Arg Cys Ser Val Gly Phe Ala 
5 10 15 20 

gee ace aac gee gee ggt cag ccc ggg tte gtc acc gee ggt cac tge 1007 
Ala Thr Asn Ala Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys 

25 30 35 

ggc cgc gtg ggc acc cag gtg acc ate ggc aac ggc agg ggc gtc tte 1055 
Gly Arg Val Gly Thr Gin Val Thr He Gly Asn Gly Arg Gly Val Phe 

40 45 50 

gag cag tec gtc tte ccc ggc aac gac gcg gcc tte gtc cgc ggt acg 1103 
Glu Gin Ser Val Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr 
55 60 65 

tec aac tte acg etg acc aac ctg gtc age cgc tac aac ace gge ggg 1151 



2 



Ser Asn Phe Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly 
70 75 80 

tac gcc acg gtc gcc ggt cac aac cag gcc ccc ate ggc tec tec gtc 1199 
Tyr Ala Thr Val Ala Gly His Asn Gin Ala Pro lie Gly Ser Ser Val 
85 90 95 100 

tge ege tec ggc tee aec ace ggt tgg cac tge ggc acc ate eag gee 1247 
Cys Arg Ser Gly Ser Thr Thr Gly Trp His Cys Gly Thr lie Gin Ala 

105 110 115 

cgc ggc cag teg gtg age tac ccc gag ggc acc gtc acc aac atg acc 1295 
Arg Gly Gin Ser Val Ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr 

120 125 130 

egg acc acc gtg tge gcc gag ccc ggc gac tec ggc ggc tec tac ate 1343 
Arg Thr Thr Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Tyr lie 
135 140 145 

tec ggc acc cag gcc cag ggc gtg acc tec ggc ggc tec ggc aac tge 1391 
Ser Gly Thr Gin Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys 
150 155 160 

cgc ace ggc ggg ace ace ttc tac eag gag gtc ace ccc atg gtg aac 1439 
Arg Thr Gly Gly Thr Thr Phe Tyr Gin Glu Val Thr Pro Met Val Asn 
165 170 175 180 

tec tgg ggc gtc cgt etc egg acc tgatcccege ggttccaggc ggaecgaegg 14 93 
Ser Trp Gly Val Arg Leu Arg Thr 

185 

tegtgacctg agtaccagge gtccccgccg ettccagegg cgteegcace ggggtgggae 1553 
cgggcgtggc eaeggceeea ecegtgaccg gaeegeeegg eta 1596 



<210> 2 
<211> 382 

<212> PRT 

<213> Noeardiopsis sp. NRRL 18262 ("Protease 10") 
<400> 2 

Met Arg Pro Ser Pro Val Val Ser Ala lie Gly Thr Gly Ala Leu 

-190 -185 -180 



Ala Phe Gly Leu Ala Leu Ser Gly Thr Pro Gly Ala Leu Ala Ala 

-175 -170 -165 



Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala Val 

-160 -155 -150 



Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr Ser Ala 

-145 -140 -135 



3 



Glu Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val 

-130 -125 -120 



Asp Glu Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly Ser 

-115 -110 -105 



Val Phe Asp Thr Glu Ser Leu Glu Leu Thr Val Leu Val Thr Asp Ala 

-100 -95 -90 



Ala Ala Val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu Val 

-85 -80 -75 



Ser Tyr Gly lie Asp Gly Leu Asp Glu lie Val Gin Glu Leu Asn Ala 
-70 -65 -60 



Ala Asp Ala Val Pro Gly Val Val Gly Trp Tyr Pro Asp Val Ala Gly 
-55 -50 -45 



Asp Thr Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val Ser 
-40 -35 -30 -25 



Gly Leu Leu Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu Val Thr 

-20 -15 -10 



Thr Ser Asp Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu Ala 

-5 -11 5 



Tyr Thr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr Asn Ala 
10 15 20 



Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Arg Val Gly 
25 30 35 40 



Thr Gin Val Thr He Gly Asn Gly Arg Gly Val Phe Glu Gin Ser Val 

45 50 55 



Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn Phe Thr 

60 65 70 



Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr Val 
75 80 85 



4 



Ala Gly His Asn Gin Ala Pro lie Gly Ser Ser Val Cys Arg Ser Gly 
90 95 100 



Ser Thr Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Gly Gin Ser 
105 110 115 120 



Val Ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr Thr Val 

125 130 135 



Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Tyr He Ser Gly Thr Gin 

140 145 150 



Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Arg Thr Gly Gly 
155 160 165 



Thr Thr Phe Tyr Gin Glu Val Thr Pro Met Val Asn Ser Trp Gly Val 
170 175 180 



Arg Leu Arg Thr 
185 



<210> 3 

<211> 1065 

<212> DNA 

<213> Nocardiopsis dassonvillei subspecies dassonvillei DSM 43235 ("Protease 
18") 



<220> 

<221> CDS 

<222> (1)..(1062) 

<220> 

<221> niat_peptide 
<222> (499) . . (1062) 

<400> 3 

get ccg gcc ccc gtc ccc cag acc ccc gtc gcc gac gac age gee 45 
Ala Pro Ala Pro Val Pro Gin Thr Pro Val Ala Asp Asp Ser Ala 
-165 -160 -155 

gcc age atg acc gag gcg etc aag cgc gac etc gac etc acc teg 90 
Ala Ser Met Thr Glu Ala Leu Lys Arg Asp Leu Asp Leu Thr Ser 
-150 -145 -140 

gcc gag gee gag gag ctt etc teg gcg cag gaa gcc gcc ate gag 135 
Ala Glu Ala Glu Glu Leu Leu Ser Ala Gin Glu Ala Ala He Glu 
-135 -130 -125 



5 



acc gac gcc gag gcc acc gag gcc gcg ggc gag gcc tac ggc ggc 180 
Thr Asp Ala Glu Ala Thr Glu Ala Ala Gly Glu Ala Tyr Gly Gly 
-120 -115 -110 

tea ctg ttc gac acc gag acc etc gaa etc acc gtg ctg gtc acc gac 228 
Ser Leu Phe Asp Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp 
-105 -100 -95 

gcc tec gee gtc gag gcg gtc gag gcc ace gga gcc cag gcc ace gtc 27 6 

Ala Ser Ala Val Glu Ala Val Glu Ala Thr Gly Ala Gin Ala Thr Val 
-90 -85 -80 -75 

gtc tec cae ggc acc gag ggc ctg acc gag gtc gtg gag gac etc aac 324 
Val Ser His Gly Thr Glu Gly Leu Thr Glu Val Val Glu Asp Leu Asn 

-70 -65 -60 

ggc gcc gag gtt cec gag age gtc etc ggc tgg tac ccg gac gtg gag 372 
Gly Ala Glu Val Pro Glu Ser Val Leu Gly Trp Tyr Pro Asp Val Glu 

-55 -50 -45 

age gac acc gtc gtg gtc gag gtg ctg gag ggc tec gac gcc gac gtc 420 
Ser Asp Thr Val Val Val Glu Val Leu Glu Gly Ser Asp Ala Asp Val 
-40 -35 -30 

gcc gcc ctg etc gcc gac gcc ggt gtg gac tec tee teg gtc egg gtg 4 68 

Ala Ala Leu Leu Ala Asp Ala Gly Val Asp Ser Ser Ser Val Arg Val 
-25 -20 -15 

gag gag gcc gag gag gcc ccg cag gtc tac gee gac ate ate ggc ggc 516 
Glu Glu Ala Glu Glu Ala Pro Gin Val Tyr Ala Asp He He Gly Gly 
-10 -5 -11 5 

ctg gcc tac tac atg ggc ggc cgc tgc tec gtc ggc ttc gcc gcg acc 564 
Leu Ala Tyr Tyr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr 

10 15 20 

aac age gee ggt cag cec ggt ttc gtc acc gcc ggc cae tgc ggc ace 612 
Asn Ser Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Thr 
25 30 35 

gtc ggc acc ggc gtg acc ate ggc aac ggc acc ggc acc ttc cag aac 660 
Val Gly Thr Gly Val Thr He Gly Asn Gly Thr Gly Thr Phe Gin Asn 
40 45 50 

teg gtc ttc cec ggc aac gac gcc gee ttc gtc cgc ggc acc tec aac 708 
Ser Val Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn 

55 60 65 70 

ttc acc ctg acc aac ctg gtc teg cgc tac aac tec ggc ggc tac cag 756 
Phe Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Ser Gly Gly Tyr Gin 

75 80 85 

teg gtg acc ggt acc age cag gcc ccg gcc ggc teg gcc gtg tgc cgc 804 
Ser Val Thr Gly Thr Ser Gin Ala Pro Ala Gly Ser Ala Val Cys Arg 

90 95 100 

tec ggc tec acc acc ggc tgg cac tgc ggc acc ate cag gcc cgc aac 852 
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Ser Gly Ser Thr Thr Gly Trp His Cys Gly Thr lie Gin Ala Arg Asn 
105 110 115 



cag acc gtg cgc tac ccg cag ggc acc gtc tac teg etc acc cgc acc 900 
Gin Thr Val Arg Tyr Pro Gin Gly Thr Val Tyr Ser Leu Thr Arg Thr 
120 125 130 

aac gtg tgc gcc gag ccc ggc gac tec ggc ggt teg tte ate tec ggc 948 
Asn Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Phe lie Ser Gly 
135 140 145 150 

teg cag gcc cag ggc gtc acc tec ggc ggc tec ggc aac tgc tec gtc 996 
Ser Gin Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Ser Val 

155 160 165 

ggc ggc aeg acc tac tac cag gag gtc ace ccg atg ate aac tec tgg 1044 
Gly Gly Thr Thr Tyr Tyr Gin Glu Val Thr Pro Met lie Asn Ser Trp 

170 175 180 

ggt gtc agg ate egg ace taa 1065 
Gly Val Arg lie Arg Thr 
185 



<210> 4 
<211> 354 

<212> PRT 

<213> Nocardiopsis dassonvillei subspecies dassonvillei DSM 43235 ("Protease 
18") 

<400> 4 

Ala Pro Ala Pro Val Pro Gin Thr Pro Val Ala Asp Asp Ser Ala 
-165 -160 -155 



Ala Ser Met Thr Glu Ala Leu Lys Arg Asp Leu Asp Leu Thr Ser 
-150 -145 -140 



Ala Glu Ala Glu Glu Leu Leu Ser Ala Gin Glu Ala Ala lie Glu 
-135 -130 -125 



Thr Asp Ala Glu Ala Thr Glu Ala Ala Gly Glu Ala Tyr Gly Gly 
-120 -115 -110 



Ser Leu Phe Asp Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp 
-105 -100 -95 



Ala Ser Ala Val Glu Ala Val Glu Ala Thr Gly Ala Gin Ala Thr Val 
-90 -85 -80 -75 



Val Ser His Gly Thr Glu Gly Leu Thr Glu Val Val Glu Asp Leu Asn 
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-70 



-65 



-60 



Gly Ala Glu Val Pro Glu Ser Val Leu Gly Trp Tyr Pro Asp Val Glu 

-55 -50 -45 



Ser Asp Thr Val Val Val Glu Val Leu Glu Gly Ser Asp Ala Asp Val 
-40 -35 -30 • 



Ala Ala Leu Leu Ala Asp Ala Gly Val Asp Ser Ser Ser Val Arg Val 
-25 -20 -15 



Glu Glu Ala Glu Glu Ala Pro Gin Val Tyr Ala Asp He He Gly Gly 
-10 -5 -11 5 



Leu Ala Tyr Tyr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr 

10 15 20 



Asn Ser Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Thr 
25 30 35 



Val Gly Thr Gly Val Thr He Gly Asn Gly Thr Gly Thr Phe Gin Asn 
40 45 50 



Ser Val Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn 
55 60 65 70 



Phe Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Ser Gly Gly Tyr Gin 

75 80 85 



Ser Val Thr Gly Thr Ser Gin Ala Pro Ala Gly Ser Ala Val Cys Arg 

90 95 100 



Ser Gly Ser Thr Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Asn 
105 110 115 



Gin Thr Val Arg Tyr Pro Gin Gly Thr Val Tyr Ser Leu Thr Arg Thr 
120 125 130 



Asn Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Phe He Ser Gly 
135 140 145 150 



Ser Gin Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Ser Val 

155 160 165 
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Gly Gly Thr Thr Tyr Tyr Gin Glu Val Thr Pro Met lie Asn Ser Trp 

170 175 180 



Gly Val Arg lie Arg Thr 
185 



<210> 5 

<211> 1062 

<212> DNA 

<213> Nocardiopsis prasina DSM 15648 ("Protease 11") 



<220> 

<221> CDS 

<222> (1)..(1059) 

<220> 

<221> mat_peptide 
<222> (496) . . (1059) 

<400> 5 

gcc acc gga ccg etc ccc cag tea ccc acc ccg gag gcc gac gcc 45 
Ala Thr Gly Pro Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala 
-165 -160 -155 



gtc tec atg cag gag gcg etc cag cgc gac etc ggc ctg acc ccg 90 

Val Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro 

-150 -145 -140 

ctt gag gcc gat gaa ctg ctg gcc gcc cag gac acc gcc ttc gag 135 

Leu Glu Ala Asp Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu 

-135 -130 -125 

gtc gac gag gcc gcg gcc gcg gee gee ggg gac gee tac ggc ggc 180 

Val Asp Glu Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly 

-120 -115 -110 

tec gtc ttc gac acc gag acc ctg gaa ctg acc gtc ctg gtc acc gac 228 

Ser Val Phe Asp Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp 

-105 -100 -95 -90 

gcc gcc teg gtc gag get gtg gag gcc acc ggc gcg ggt acc gaa etc 276 
Ala Ala Ser Val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu 

-85 -80 -75 

gtc tec tac ggc ate gag ggc etc gac gag ate ate cag gat etc aac 324 
Val Ser Tyr Gly lie Glu Gly Leu Asp Glu lie lie Gin Asp Leu Asn 

-70 -65 -60 

gcc gcc gac gcc gtc ccc ggc gtg gtc ggc tgg tac ccg gac gtg gcg 372 
Ala Ala Asp Ala Val Pro Gly Val Val Gly Trp Tyr Pro Asp Val Ala 
-55 -50 -45 
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ggt gac acc gtc gtc ctg gag gtc ctg gag ggt tec gga gcc gac gtg 420 

Gly Asp Thr Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val 

-40 -35 -30 

age ggc ctg etc gee gac gee ggc gtg gac gee teg gcc gtc gag gtg 468 

Ser Gly Leu Leu Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu Val 

-25 -20 -15 -10 

acc age agt geg eag eee gag etc tac gee gac ate ate ggc ggt ctg 516 

Thr Ser Ser Ala Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu 

-5 -11 5 

gcc tac acc atg ggc ggc cgc tgt teg gtc gga tte gcg gcc acc aac 564 

Ala Tyr Thr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr Asn 

10 15 20 

gcc gcc ggt eag ccc gga ttc gtc acc gcc ggt cac tgt ggc cgc gtg 612 

Ala Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Arg Val 

25 30 35 

ggc acc eag gtg age ate ggc aac ggc cag ggc gtc ttc gag eag tec 660 

Gly Thr Gin Val Ser lie Gly Asn Gly Gin Gly Val Phe Glu Gin Ser 

40 45 50 55 

ate ttc ecg ggc aac gac gcc gcc ttc gtc cgc ggc acg tec aac ttc 708 

lie Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn Phe 

60 65 70 

acg ctg acc aac ctg gtc age cgc tac aac acc ggc ggt tac gcc acc 756 

Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 

75 80 85 

gtc gee ggc cac aac cag geg eee ate ggc tec tec gtc tge cgc tee 804 

Val Ala Gly His Asn Gin Ala Pro lie Gly Ser Ser Val Cys Arg Ser 

90 95 100 

ggc tee ace acc ggc tgg cac tge ggc acc ate cag gcc cgc ggc cag 852 

Gly Ser Thr Thr Gly Trp His Cys Gly Thr lie Gin Ala Arg Gly Gin 

105 110 115 

teg gtg age tac ccc gag ggc ace gtc ace aac atg acc egg ace ace 900 

Ser Val Ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr Thr 

120 125 130 135 

gtg tge gee gag ccc ggc gac tee ggc ggc tec tac ate tec ggc aac 94 8 

Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Tyr lie Ser Gly Asn 

140 145 150 

cag gcc eag ggc gtc acc tee ggc ggc tec ggc aac tge cgc ace ggc 996 

Gin Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Arg Thr Gly 

155 160 165 

ggg ace acc ttc tac cag gag gtc ace ccc atg gtg aac tee tgg ggc 1044 

Gly Thr Thr Phe Tyr Gin Glu Val Thr Pro Met Val Asn Ser Trp Gly 

170 175 180 

gtc egt etc egg acc taa 1062 
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Val Arg Leu Arg Thr 
185 



<210> 6 

<211> 353 

<212> PRT 

<213> Nocardiopsis prasina DSM 15648 ("Protease 11") 

<400> 6 

Ala Thr Gly Pro Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala 
-165 -160 -155 



Val Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro 
-150 -145 -140 



Leu Glu Ala Asp Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu 
-135 -130 -125 



Val Asp Glu Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly 
-120 -115 -110 



Ser Val Phe Asp Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp 
-105 -100 -95 -90 



Ala Ala Ser Val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu 

-85 -80 -75 



Val Ser Tyr Gly lie Glu Gly Leu Asp Glu lie lie Gin Asp Leu Asn 

-70 -65 -60 



Ala Ala Asp Ala Val Pro Gly Val Val Gly Trp Tyr Pro Asp Val Ala 
-55 -50 -45 



Gly Asp Thr Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val 
-40 -35 -30 



Ser Gly Leu Leu Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu Val 
-25 -20 -15 -10 



Thr Ser Ser Ala Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu 

-5 -11 5 



Ala Tyr Thr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr Asn 
10 15 20 
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Ala Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Arg Val 
25 30 35 



Gly Thr Gin Val Ser He Gly Asn Gly Gin Gly Val Phe Glu Gin Ser 

40 45 50 55 



lie Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn Phe 

60 65 70 



Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 

75 80 85 



Val Ala Gly His Asn Gin Ala Pro He Gly Ser Ser Val Cys Arg Ser 
90 95 100 



Gly Ser Thr Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Gly Gin 
105 110 115 



Ser Val Ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr Thr 
120 125 130 135 



Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Tyr He Ser Gly Asn 

140 145 150 



Gin Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Arg Thr Gly 

155 160 165 



Gly Thr Thr Phe Tyr Gin Glu Val Thr Pro Met Val Asn Ser Trp Gly 
170 175 180 



Val Arg Leu Arg Thr 
185 



<210> 7 

<211> 1062 

<212> DNA 

<213> Nocardiopsis prasina DSM 15649 ("Protease 35") 



<220> 

<221> CDS 

<222> (1)..(1059) 

<220> 
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<221> mat_peptide 
<222> (496) . . (1059) 

<400> 7 

gcc acc gga cca etc ccc cag tea ccc acc ccg gag gcc gac gcc 45 
Ala Thr Gly Pro Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala 
-165 -160 -155 

gtc tec atg cag gag gcg etc cag cgc gac etc gge ctg ace ccg 90 
Val Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro 
-150 -145 -140 

ett gag gcc gat gaa ctg ctg gcc gcc cag gac acc gcc ttc gag 135 
Leu Glu Ala Asp Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu 
-135 -130 -125 

gtc gac gag gcc gcg gcc gag gcc gcc ggt gac gcc tac ggc ggc 180 
Val Asp Glu Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly 
-120 -115 -110 

tec gtc ttc gac acc gag acc ctg gaa ctg acc gtc ctg gtc acc gac 228 
Ser Val Phe Asp Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp 
-105 -100 -95 -90 

tec gcc gcg gtc gag gcg gtg gag gcc acc ggc gcc ggg acc gaa ctg 27 6 

Ser Ala Ala Val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu 

-85 -80 -75 

gtc tec tac ggc ate acg ggc etc gac gag ate gtc gag gag etc aae 324 
Val Ser Tyr Gly lie Thr Gly Leu Asp Glu lie Val Glu Glu Leu Asn 

-70 -65 -60 

gee gee gac gee gtt ccc gge gtg gtc gge tgg tac ccg gac gtc gcg 372 
Ala Ala Asp Ala Val Pro Gly Val Val Gly Trp Tyr Pro Asp Val Ala 
-55 -50 -45 

ggt gac ace gtc gtg ctg gag gtc ctg gag ggt tec ggc gee gac gtg 420 
Gly Asp Thr Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val 
-40 -35 -30 

gge gge ctg etc gee gac gee gge gtg gac gee teg gcg gtc gag gtg 4 68 

Gly Gly Leu Leu Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu Val 
-25 -20 -15 -10 

acc ace acc gag cag ccc gag ctg tac gee gac ate ate gge ggt ctg 516 
Thr Thr Thr Glu Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu 

-5 -11 5 

gee tac ace atg ggc ggc cgc tgt teg gtc gge ttc gcg gee acc aae 564 
Ala Tyr Thr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr Asn 

10 15 20 

gee gee ggt cag ccc ggg ttc gtc acc gee ggt eac tgt gge cgc gtg 612 
Ala Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Arg Val 
25 30 35 

gge acc cag gtg ace ate gge aae gge egg gge gtc ttc gag cag tee 660 
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Gly Thr Gin Val Thr He Gly Asn Gly Arg Gly Val Phe Glu Gin Ser 

40 45 50 55 

ate ttc ccg ggc aac gac gcc gcc ttc gtc cgc gga acg tec aac ttc 708 

He Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn Phe 

60 65 70 

acg ctg acc aac ctg gtc age cgc tac aac acc ggc ggc tac gcc acc 756 

Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 

75 80 85 

gtc gcc ggt cac aac cag gcg ccc ate ggc tec tec gtc tgc cgc tec 804 

Val Ala Gly His Asn Gin Ala Pro He Gly Ser Ser Val Cys Arg Ser 
90 95 100 

ggc tee acc ace ggt tgg cac tgc ggc ace ate cag gee cgc ggc cag 852 

Gly Ser Thr Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Gly Gin 
105 110 115 

teg gtg age tac ccc gag ggc ace gtc ace aac atg acg egg ace ace 900 

Ser Val Ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr Thr 

120 125 130 135 

gtg tgc gee gag ccc ggc gac tee ggc ggc tee tac ate tee ggc aac 948 

Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Tyr He Ser Gly Asn 

140 145 150 

cag gee cag ggc gtc ace tee ggc ggc tee ggc aac tgc cgc ace ggc 996 

Gin Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Arg Thr Gly 

155 160 165 

ggg ace ace ttc tac cag gag gtc ace ccc atg gtg aac tee tgg ggc 1044 

Gly Thr Thr Phe Tyr Gin Glu Val Thr Pro Met Val Asn Ser Trp Gly 
170 175 180 

gtc egt etc egg acc taa 1062 
Val Arg Leu Arg Thr 
185 



<210> 8 

<211> 353 

<212> PRT 

<213> Nocardiopsis prasina DSM 15649 ("Protease 35") 

<400> 8 

Ala Thr Gly Pro Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala 
-165 -160 -155 



Val Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro 
-150 -145 -140 



Leu Glu Ala Asp Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu 
-135 -130 -125 
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Val Asp Glu Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly 
-120 -115 -110 



Ser Val Phe Asp Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp 
-105 -100 -95 -90 



Ser Ala Ala Val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu 

-85 -80 -75 



Val Ser Tyr Gly lie Thr Gly Leu Asp Glu lie Val Glu Glu Leu Asn 

-70 -65 -60 



Ala Ala Asp Ala Val Pro Gly Val Val Gly Trp Tyr Pro Asp Val Ala 
-55 -50 -45 



Gly Asp Thr Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val 
-40 -35 -30 



Gly Gly Leu Leu Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu Val 
-25 -20 -15 -10 



Thr Thr Thr Glu Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu 

-5 -11 5 



Ala Tyr Thr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr Asn 
10 15 20 



Ala Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Arg Val 
25 30 35 



Gly Thr Gin Val Thr He Gly Asn Gly Arg Gly Val Phe Glu Gin Ser 
40 45 50 55 



He Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn Phe 

60 65 70 



Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 

75 80 85 



Val Ala Gly His Asn Gin Ala Pro He Gly Ser Ser Val Cys Arg Ser 
90 95 100 
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Gly Ser Thr Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Gly Gin 
105 110 115 



Ser Val Ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr Thr 
120 125 130 135 



Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Tyr He Ser Gly Asn 

140 145 150 



Gin Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Arg Thr Gly 

155 160 165 



Gly Thr Thr Phe Tyr Gin Glu Val Thr Pro Met Val Asn Ser Trp Gly 
170 175 180 



Val Arg Leu Arg Thr 
185 



<210> 9 

<211> 1068 

<212> DNA 

<213> Nocardiopsis alba DSM 15647 ("Protease 08") 



<220> 

<221> CDS 

<222> (1) . . (1065) 

<220> 

<221> mat^peptide 
<222> (502) . . (1065) 

<400> 9 

gcg acc ggc ccc etc ccc cag tec ecc ace ccg gat gaa gee gag 45 
Ala Thr Gly Pro Leu Pro Gin Ser Pro Thr Pro Asp Glu Ala Glu 
-165 -160 -155 

gee acc acc atg gtc gag gee etc cag cgc gae etc ggc ctg tec 90 
Ala Thr Thr Met Val Glu Ala Leu Gin Arg Asp Leu Gly Leu Ser 
-150 -145 -140 

ccc tct cag gee gae gag etc etc gag gcg cag gee gag tec tte 135 
Pro Ser Gin Ala Asp Glu Leu Leu Glu Ala Gin Ala Glu Ser Phe 
-135 -130 -125 

gag ate gae gag gee gee acc gcg gee gea gee gae tec tac ggc 180 
Glu He Asp Glu Ala Ala Thr Ala Ala Ala Ala Asp Ser Tyr Gly 
-120 -115 -110 

ggc tee ate tte gae acc gae age etc ace ctg ace gtc ctg gtc ace 228 
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Gly Ser lie Phe Asp Thr Asp Ser Leu Thr Leu Thr Val Leu Val Thr 
-105 -100 -95 



gac gcc tec gcc gtc gag gcg gtc gag gcc gcc ggc gcc gag gcc aag 27 6 

Asp Ala Ser Ala Val Glu Ala Val Glu Ala Ala Gly Ala Glu Ala Lys 

-90 -85 -80 

gtg gtc teg cac ggc atg gag ggc ctg gag gag ate gtc gcc gac ctg 324 

Val Val Ser His Gly Met Glu Gly Leu Glu Glu He Val Ala Asp Leu 

-75 -70 -65 -60 

aac gcg gcc gac get cag cec ggc gtc gtg ggc tgg tac ccc gac ate 372 

Asn Ala Ala Asp Ala Gin Pro Gly Val Val Gly Trp Tyr Pro Asp He 

-55 -50 -45 

cac tec gac aeg gtc gtc etc gag gtc etc gag ggc tec ggt gee gac 420 

His Ser Asp Thr Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp 

-40 -35 -30 

gtg gac tec ctg etc gcc gac gee ggt gtg gac ace gcc gac gtc aag 4 68 

Val Asp Ser Leu Leu Ala Asp Ala Gly Val Asp Thr Ala Asp Val Lys 

-25 -20 -15 

gtg gag age ace acc gag cag cec gag ctg tac gee gac ate ate ggc 516 

Val Glu Ser Thr Thr Glu Gin Pro Glu Leu Tyr Ala Asp lie He Gly 

-10 -5 -11 5 

ggt etc gcc tac acc atg ggt ggg ege tge teg gtc ggc ttc gcg gee 564 

Gly Leu Ala Tyr Thr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala 

10 15 20 

acc aac gee tec ggc cag cec ggg ttc gtc acc gcc ggc cac tge ggc 612 

Thr Asn Ala Ser Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly 

25 30 35 

acc gtc ggc ace ecg gtc age ate ggc aac ggc cag ggc gtc ttc gag 660 

Thr Val Gly Thr Pro Val Ser He Gly Asn Gly Gin Gly Val Phe Glu 

40 45 50 

cgt tec gtc ttc cec ggc aac gac tee gee ttc gtc ege ggc ace teg 708 

Arg Ser Val Phe Pro Gly Asn Asp Ser Ala Phe Val Arg Gly Thr Ser 

55 60 65 

aac ttc acc ctg ace aac ctg gtc age ege tac aac acc ggt ggt tac 756 

Asn Phe Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr 

70 75 80 85 

gcg acc gtc tee ggc tee teg cag gcg gcg ate ggc teg cag ate tge 804 

Ala Thr Val Ser Gly Ser Ser Gin Ala Ala He Gly Ser Gin He Cys 

90 95 100 

cgt tec ggc tec acc acc ggc tgg cac tge ggc acc gtc cag gee ege 852 

Arg Ser Gly Ser Thr Thr Gly Trp His Cys Gly Thr Val Gin Ala Arg 

105 110 115 

ggc cag aeg gtg age tac ccc cag ggc acc gtg cag aac ctg acc cgc 900 

Gly Gin Thr Val Ser Tyr Pro Gin Gly Thr Val Gin Asn Leu Thr Arg 
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120 



125 



130 



acc aac gtc tgc gcc gag ccc ggt gac tec ggc ggc tec ttc ate tec 948 

Thr Asn Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Phe lie Ser 
135 140 145 

ggc age cag gee cag ggc gtc acc tec ggt ggc tec ggc aac tgc tec 996 

Gly Ser Gin Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Ser 

150 155 160 165 

ttc ggt ggc ace acc tac tac cag gag gtc aac eeg atg ctg age age 1044 

Phe Gly Gly Thr Thr Tyr Tyr Gin Glu Val Asn Pro Met Leu Ser Ser 

170 175 180 

tgg ggt ctg acc ctg cge ace tga 1068 

Trp Gly Leu Thr Leu Arg Thr 

185 



<210> 10 
<211> 355 
<212> PRT 

<213> Nocardiopsis alba DSM 15647 ("Protease 08") 
<400> 10 

Ala Thr Gly Pro Leu Pro Gin Ser Pro Thr Pro Asp Glu Ala Glu 
-165 -160 -155 



Ala Thr Thr Met Val Glu Ala Leu Gin Arg Asp Leu Gly Leu Ser 
-150 -145 -140 



Pro Ser Gin Ala Asp Glu Leu Leu Glu Ala Gin Ala Glu Ser Phe 
-135 -130 -125 



Glu lie Asp Glu Ala Ala Thr Ala Ala Ala Ala Asp Ser Tyr Gly 
-120 -115 -110 



Gly Ser lie Phe Asp Thr Asp Ser Leu Thr Leu Thr Val Leu Val Thr 
-105 -100 -95 



Asp Ala Ser Ala Val Glu Ala Val Glu Ala Ala Gly Ala Glu Ala Lys 
-90 -85 -80 



Val Val Ser His Gly Met Glu Gly Leu Glu Glu lie Val Ala Asp Leu 
-75 -70 -65 -60 



Asn Ala Ala Asp Ala Gin Pro Gly Val Val Gly Trp Tyr Pro Asp lie 

-55 -50 -45 
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His Ser Asp Thr Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp 

-40 -35 -30 



Val Asp Ser Leu Leu Ala Asp Ala Gly Val Asp Thr Ala Asp Val Lys 
-25 -20 -15 



Val Glu Ser Thr Thr Glu Gin Pro Glu Leu Tyr Ala Asp lie lie Gly 
-10 -5 -11 5 



Gly Leu Ala Tyr Thr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala 

10 15 20 



Thr Asn Ala Ser Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly 

25 30 35 



Thr Val Gly Thr Pro Val Ser He Gly Asn Gly Gin Gly Val Phe Glu 
40 45 50 



Arg Ser Val Phe Pro Gly Asn Asp Ser Ala Phe Val Arg Gly Thr Ser 

55 60 65 



Asn Phe Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr 

70 75 80 85 



Ala Thr Val Ser Gly Ser Ser Gin Ala Ala He Gly Ser Gin He Cys 

90 95 100 



Arg Ser Gly Ser Thr Thr Gly Trp His Cys Gly Thr Val Gin Ala Arg 

105 110 115 



Gly Gin Thr Val Ser Tyr Pro Gin Gly Thr Val Gin Asn Leu Thr Arg 

120 125 130 



Thr Asn Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Phe He Ser 

135 140 145 



Gly Ser Gin Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Ser 

150 155 160 165 



Phe Gly Gly Thr Thr Tyr Tyr Gin Glu Val Asn Pro Met Leu Ser Ser 

170 175 180 
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Trp Gly Leu Thr Leu Arg Thr 

185 



<210> 11 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 11 

ccgattatgg agcggattga acatgcg 27 



<210> 12 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 12 

gtgaccatcg gcgacggcag gggcgtcttc g 31 

<210> 13 

<211> 10172 

<212> DNA 

<213> Artificial 

<220> 

<223> Expression Cassette 



<220> 

<221> source 

<222> (1) . . (3323) 

<223> Bacillus subtilis genome sequence including 
yfmH-yfmD-yfmC-yfmB-yfmA genes 

<220> 

<221> misc_recomb 

<222> (3561) . . (4208) 

<223> Cat gene providing chloramphenicol resistance 
<220> 

<221> promoter 

<222> (4523) . . (5633) 

<223> Triple PamyL-scBAN-CrylllA promoter including mRNA stabilizing 
sequence 

<220> 

<221> sig_peptide 

<222> (5658) . . (5738) 
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<220> 

<221> CDS 

<222> (5658) . . (6797) 
<220> 

<221> mat_peptide 

<222> (6234) . . (6797) 

<220> 

<221> source 

<222> (6839) . . (7540) 

<223> Part of Bacillus subtilis pectate lyase gene 
<220> 

<221> source 

<222> (7541) . . (10172) 

<223> Bacillus subtilis genome DNA inclding yflS-citM genes 

<400> 13 



gagtatcgcc 


agtaaggggc 


gtttttgttt 


tctggttgtt 


ttcttcattt 


caggtttcgc 


60 


cctttccttg 


ccaaatataa 


gaaaaacggc 


gttccgataa 


tcgcggtgac 


aatgccgacc 


120 


ggtgattcat 


aaggaaatgc 


aatccatctg 


gccagaacat 


ctgcgtacac 


cagcaaaatg 


180 


gcaccgaaca 


gtgccgaaaa 


cggaagcacg 


tattgataat 


gttctccgat 


cagcttgcgg 


240 


acaatatgcg 


ggacgagcag 


cccgacaaag 


ccaatcggcc 


cggcgacggc 


tacggaagcg 


300 


ccggaaagaa 


ttaaaataat 


caaactgatc 


agaatcctga 


tgccgttcat 


attttgtcca 


360 


agcccttttg 


ctgtttcgtc 


tccgagaccg 


agaacagaaa 


cagaaccgga 


aaatacgagg 


420 


gcaagcccga 


tgccaatgac 


agaaaaagga 


gcgatggtta 


tgacgtcctg 


ccagttgctg 


480 


ccgtcgattg 


cgcctgtcat 


ccagtacaga 


acatcctcac 


ctgactcatt 


taaaataatg 


540 


atggcctgtg 


tcatagagga 


gaggaacaag 


tgcacggcca 


ttcctgacag 


cgccagcttg 


600 


acaggcgtca 


ttccgccgga 


tgaggcaatc 


atatacacaa 


tcgcgccgcc 


tgctgccgca 


660 


cccgcaaaag 


cgaatataac 


agatgaatag 


ggcgatgccg 


gcagaatgac 


gagagaagca 


720 


acaacaaaaa 


gcgatgcacc 


cgcattcaca 


ccgaaaattt 


ggggtgaagc 


cagaggattt 


780 


ctggtcatag 


cctgcatcag 


cgcccctgct 


acagctaggc 


tggcgccgac 


aaaaacgccg 


840 


attaatgtgc 


ggggaaggcg 


aagagtagag 


atgatgagct 


gttcctttga 


accgtcccat 


900 


acaaaaagat 


atttcaatga 


atctatgatg 


ctgatgtctg 


aggctcctac 


tgaaagattc 


960 


agcccaagcc 


caaatataaa 


aataatcagt 


gcaatgataa 


acatcatcag 


tcttgatgat 


1020 


gagcgccgtt 


tggctgaatg 


atacaacagt 


ctcacttcct 


tactgcgtct 


ggttgcaaaa 


1080 


acgaagaagc 


aaggattccc 


ctcgcttctc 


atttgtccta 


tttattatac 


acttttttaa 


1140 
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gcacatcttt 


ggcgcttgtt 


tcactagact 


tgatgcctct 


gaatcttgtc 


caagtgtcac 


1200 


ggtccgcatc 


atagacttgt 


ccatttttca 


ccgctttgag 


atttttccag 


agcgggttcg 


1260 


ttttccactc 


atctacaatg 


gttttgcctt 


cgttggctga 


gatgaacaaa 


atatcaggat 


1320 


cgattttgct 


caattgctca 


aggctgacct 


cttgataggc 


gttatctgac 


ttcacagcgt 


1380 


gtgtaaagcc 


tagcatttta 


aagatttctc 


cgtcatagga 


tgatgatgta 


tgaagctgga 


1440 


aggaatccgc 


tcttgcaacg 


ccgagaacga 


tgttgcggtt 


ttcatctttc 


ggaagttcgg 


1500 


cttttagatc 


gttgatgact 


tttttgtgct 


cggcaagctt 


ttcttttcct 


tcatcttctt 


1560 


tatttaatgc 


tttagcaatg 


gtcgtaaagc 


tgtcgatcgt 


ttcgtcatat 


gtcgcttcac 


1620 


ggctttttaa 


ttcaatcgtc 


ggggcgattt 


ttttcagctg 


tttataaatg 


tttttatggc 


1680 


gctcagcgtc 


agcgatgatt 


aaatcaggct 


tcaaggaact 


gatgacctca 


agattgggtt 


1740 


cgctgcgtgt 


gcctacagat 


gtgtaatcaa 


tggagctgcc 


gacaagcttt 


ttaatcatat 


1800 


cttttttgtt 


gtcatctgcg 


atgcccaccg 


gcgtaatgcc 


gagattgtga 


acggcatcca 


1860 


agaatgaaag 


ctcaagcaca 


accacccgct 


taggtgtgcc 


gcttactgtc 


gtttttcctt 


1920 


cttcgtcatg 


gatcactctg 


gaatccttag 


actcgctttt 


gccgcttccg 


ttgttattct 


1980 


ggcttgatga 


acagccggat 


acaatgaggc 


aggcgagcaa 


taaaacactc 


atgatggcaa 


2040 


tcaacttgtt 


agaataggtg 


cgcatgtcat 


tcttcctttt 


ttcagattta 


gtaatgagaa 


2100 


tcattatcac 


atgtaacact 


ataatagcat 


ggcttatcat 


gtcaatattt 


ttttagtaaa 


2160 


gaaagctgcg 


tttttactgc 


tttctcatga 


aagcatcatc 


agacacaaat 


aagtggtatg 


2220 


cagcgttacc 


gtgtcttcga 


gacaaaaacg 


catgggcgtt 


ggctttagag 


gtttcgaaca 


2280 


tatcagcagt 


gacataagga 


aggagagtgc 


tgagataacc 


ggacaatttc 


ttttctattt 


2340 


catctgttag 


tgcaaattca 


atgtcgccga 


tattcatgat 


aatcgagaaa 


acaaagtcga 


2400 


tatcgatatg 


aaaatgttcc 


tcggcaaaaa 


ccgcaagctc 


gtgaattcct 


ggtgaacatc 


2460 


cggcacgctt 


atggaaaatc 


tgtttgacta 


aatcactcac 


aatccaagca 


ttgtattgct 


2520 


gttctggtga 


aaagtattgc 


attagacata 


cctcctgctc 


gtacggataa 


aggcagcgtt 


2580 


tcatggtcgt 


gtgctccgtg 


cagcggcttc 


tccttaattt 


tgatttttct 


gaaaataggt 


2640 


cccgttccta 


tcactttacc 


atggacggaa 


aacaaatagc 


tactaccatt 


cctcctgttt 


2700 


ttctcttcaa 


tgttctggaa 


tctgtttcag 


gtacagacga 


tcgggtatga 


aagaaatata 


2760 


gaaaacatga 


aggaggaata 


tcgacatgaa 


accagttgta 


aaagagtata 


caaatgacga 


2820 
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acagctcatg aaagatgtag aggaattgca gaaaatgggt gttgcgaaag aggatgtata 2880 

cgtcttagct cacgacgatg acagaacgga acgcctggct gacaacacga acgccaacac 2940 

gatcggagcc aaagaaacag gtttcaagca cgcggtggga aatatcttca ataaaaaagg 3000 

agacgagctc cgcaataaaa ttcacgaaat cggtttttct gaagatgaag ccgctcaatt 3060 

tgaaaaacgc ttagatgaag gaaaagtgct tctctttgtg acagataacg aaaaagtgaa 3120 

agcttgggca taaagcaagg aaaaaaccaa aaggccaatg tcggcctttt ggtttttttg 3180 

cggtctttgc ggtgggattt tgcagaatgc cgcaatagga tagcggaaca ttttcggttc 3240 

tgaatgtccc tcaatttgct attatatttt tgtgataaat tggaataaaa tctcacaaaa 3300 

tagaaaatgg gggtacatag tggccatcat ggccagctag catgcacatg ggatctggga 3360 

ccaataataa tgactagaga agaaagaatg aagattgttc atgaaattaa ggaacgaata 3420 

ttggataaag tggggtattt ttaaaatata tatttatgtt acagtaatat tgacttttaa 3480 

aaaaggattg attctaagaa gaaagcagac aagtaagcct cctaaattca ctttagataa 3540 

aaatttagga ggcatatcaa atgaacttta ataaaattga tttagacaat tggaagagaa 3600 

aagagatatt taatcattat ttgaaccaac aaacgacttt tagtataacc acagaaattg 3660 

atattagtgt tttataccga aacataaaac aagaaggata taaattttac cctgcattta 3720 

ttttcttagt gacaagggtg ataaactcaa atacagcttt tagaactggt tacaatagcg 3780 

acggagagtt aggttattgg gataagttag agccacttta tacaattttt gatggtgtat 3840 

ctaaaacatt ctctggtatt tggactcctg taaagaatga cttcaaagag ttttatgatt 3900 

tatacctttc tgatgtagag aaatataatg gttcggggaa attgtttccc aaaacaccta 3960 

tacctgaaaa tgctttttct ctttctatta ttccatggac ttcatttact gggtttaact 4020 

taaatatcaa taataatagt aattaccttc tacccattat tacagcagga aaattcatta 4080 

ataaaggtaa ttcaatatat ttaccgctat ctttacaggt acatcattct gtttgtgatg 4140 

gttatcatgc aggattgttt atgaactcta ttcaggaatt gtcagatagg cctaatgact 4200 

ggcttttata atatgagata atgccgactg tactttttac agtcggtttt ctaacgatac 4260 

attaataggt acgaaaaagc aacttttttt gcgcttaaaa ccagtcatac caataactta 4320 

agggtaacta gcctcgccgg aaagagcgaa aatgcctcac atttgtgcca cctaaaaagg 4380 

agcgatttac atatgagtta tgcagtttgt agaatgcaaa aagtgaaatc agctggacta 4440 

aaaggcatgg catgccttcg atagtttatt aatattagtg gagctcagtg agagcgaagc 4500 

gaacacttga ttttttaatt ttctatcttt tataggtcat tagagtatac ttatttgtcc 4560 
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4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 

aaaggagagg ataaaga atg aag aaa ccg ttg ggg aaa att gtc gca age 5690 

Met Lys Lys Pro Leu Gly Lys lie Val Ala Ser 
-190 -185 

acc gca eta ete att tct gtt get ttt agt tea teg ate gca teg 5735 
Thr Ala Leu Leu lie Ser Val Ala Phe Ser Ser Ser lie Ala Ser 
-180 -175 -170 

get gee ace gga geg ete ecc cag tea ecc ace ccg gag gee gac 5780 
Ala Ala Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp 
-165 -160 -155 

gcg gtc tee atg cag gag gcg ete cag ege gac etc gac etg ace 5825 
Ala Val Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr 
-150 -145 -140 

tec gee gag gcc gag gag ctg etg gee gee cag gac acc gee ttc 5870 
Ser Ala Glu Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe 
-135 -130 -125 



tataaactat 


ttagcagcat 


aatagattta 


ttgaataggt 


catttaagtt 


gagcatatta 


ggggaggaaa 


atettggaga 


aatatttgaa 


gaacccgagg 


atetagatea 


ggtaeegeaa 


cgttcgcaga 


tgctgctgaa 


gagattatta 


aaaagctgaa 


ageaaaaggc 


tatcaattgg 


taactgtatc 


tcagcttgaa 


gaagtgaaga 


agcagagagg 


ctattgaata 


aatgagtaga 


aagegceata 


teggegcttt 


tcttttggaa 


gaaaatatag 


ggaaaatggt 


acttgttaaa 


aattcggaat 


atttatacaa 


tatcatatgt 


atcacattga 


aaggaggggc 


etgctgtcca 


gaetgtccge 


tgtgtaaaaa 


aaaggaataa 


aggggggttg 


aeattatttt 


actgatatgt 


ataatataat 


ttgtataaga 


aaatggaggg 


gcectegaaa 


cgtaagatga 


aaccttagat 


aaaagtgett 


tttttgttge 


aattgaagaa 


ttattaatgt 


taagettaat 


taaagataat 


atctttgaat 


tgtaacgccc 


ctcaaaagta 


agaactacaa 


aaaaagaata 


cgttatatag 


aaatatgttt 


gaaecttctt 


cagattaeaa 


atatattegg 


aeggactcta 


cetcaaatge 


ttatctaact 


atagaatgac 


atacaagcac 


aaccttgaaa 


atttgaaaat 


ataactacca 


atgaacttgt 


tcatgtgaat 


tatcgetgta 


tttaattttc 


teaatteaat 


atataatatg 


ccaataeatt 


gttacaagta 


gaaattaaga 


cacecttgat 


agccttacta 


tacctaacat 


gatgtagtat 


taaatgaata 


tgtaaatata 


tttatgataa 


gaagcgactt 


atttataatc 


attacatatt 


tttctattgg 


aatgattaag 


atteeaatag 


aatagtgtat 


aaattattta 


tettgaaagg 


agggatgeet 


aaaaacgaag 


aacattaaaa 


acatatattt 


gcacegtcta 


atggatttat 


gaaaaatcat 


tttateagtt 


tgaaaattat 


gtattatgga 


gctetgaaaa 




gag gtc gac gag gcc gcg gcc gag gcc gcc ggg gac gcc tac ggc 5915 
Glu Val Asp Glu Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly 
-120 -115 -110 



ggc tec gtc ttc gac acc gag age ctg gaa ctg acc gtc ctg gtc acc 5963 
Gly Ser Val Phe Asp Thr Glu Ser Leu Glu Leu Thr Val Leu Val Thr 
-105 -100 -95 

gat gcc gcc gcg gtc gag gcc gtg gag gcc acc ggc gcc ggg acc gag 6011 
Asp Ala Ala Ala Val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu 
-90 -85 -80 -75 

ctg gtc tec tac ggc ate gac ggt etc gac gag ate gtc cag gag etc 6059 
Leu Val Ser Tyr Gly lie Asp Gly Leu Asp Glu lie Val Gin Glu Leu 

-70 -65 -60 

aac gee gee gac gcc gtt ccc ggt gtg gtc ggc tgg tac ccg gac gtg 6107 
Asn Ala Ala Asp Ala Val Pro Gly Val Val Gly Trp Tyr Pro Asp Val 

-55 -50 -45 

gcg ggt gac acc gtc gtc ctg gag gtc ctg gag ggt tec gga gcc gac 6155 
Ala Gly Asp Thr Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp 
-40 -35 -30 

gtc age ggc ctg etc gcg gac gcc ggc gtg gac gee teg gcc gtc gag 6203 
Val Ser Gly Leu Leu Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu 
-25 -20 -15 

gtg ace acg age gac eag ccc gag etc tac gee gae ate ate ggt ggt 6251 
Val Thr Thr Ser Asp Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly 
-10 -5 -11 5 

ctg gee tac ace atg ggc ggc egc tgt teg gtc ggc ttc gcg gee ace 6299 
Leu Ala Tyr Thr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr 

10 15 20 

aac gee gee ggt eag ccc ggg ttc gtc ace gee ggt cac tge ggc egc 6347 
Asn Ala Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Arg 
25 30 35 

gtg ggc ace eag gtg ace ate ggc aac ggc agg ggc gtc tte gag eag 6395 
Val Gly Thr Gin Val Thr He Gly Asn Gly Arg Gly Val Phe Glu Gin 

40 45 50 

tee gtc ttc ccc ggc aac gae gcg gee tte gtc egc ggt acg tee aac 6443 
Ser Val Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn 
55 60 65 70 

tte acg ctg acc aac etg gtc age egc tac aac ace ggc ggg tae gee 64 91 
Phe Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr Ala 

75 80 85 

acg gtc gee ggt eac aac eag gee eee ate ggc tec tee gtc tge egc 6539 
Thr Val Ala Gly His Asn Gin Ala Pro He Gly Ser Ser Val Cys Arg 

90 95 100 

tee ggc tee ace ace ggt tgg cac tge ggc ace ate eag gee egc ggc 6587 
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Ser Gly Ser Thr Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Gly 
105 110 115 

cag teg gtg age tac ccc gag ggc acc gtc acc aac atg acc egg ace 6635 
Gin Ser Val Ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr 
120 125 130 

acc gtg tgc gee gag ccc ggc gac tec ggc ggc tee tac ate tec ggc 6683 
Thr Val Cys Ala Glu Pro Gly Asp Ser Gly Gly Ser Tyr He Ser Gly 
135 140 145 150 

acc cag gcc cag ggc gtg acc tec ggc ggc tec ggc aac tgc egc acc 6731 
Thr Gin Ala Gin Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Arg Thr 

155 160 165 

ggc ggg acc acc ttc tac cag gag gtc acc ccc atg gtg aac tec tgg 6779 
Gly Gly Thr Thr Phe Tyr Gin Glu Val Thr Pro Met Val Asn Ser Trp 

170 175 180 

ggc gtc egt etc egg acc taatcgeatg tteaateege teeataateg 6827 
Gly Val Arg Leu Arg Thr 
185 



gtcgacgcgg 


eggttegcgt 


ccggaeagea 


cateacegaa 


atattatgga 


agaaaatatc 


6887 


agcaccatga 


cggccaaaeg 


gatgcttcca 


acggtgetaa 


etatatcacg 


atgteetaea 


6947 


actattatca 


cgatcatgat 


aaaagctcca 


ttttcggate 


aagtgacagc 


aaaacctccg 


7007 


atgaeggcaa 


attaaaaatt 


acgctgcatc 


ataaccgcta 


taaaaatatt 


gtecagcgcg 


7067 


egcegagagt 


ccgcttcggg 


caagtgcacg 


tatacaacaa 


ctattatgaa 


ggaagcacaa 


7127 


getcttcaag 


ttatcctttt 


agetatgcat 


ggggaatcgg 


aaagtcatct 


aaaatctatg 


7187 


cecaaaaeaa 


tgtcattgac 


gtaeegggae 


tgtcagctge 


taaaacgate 


agegtattca 


7247 


gcgggggaac 


ggctttatat 


gactccggca 


cgttgctgaa 


cggcaeacag 


atcaacgcat 


7307 


cggctgeaaa 


cgggctgage 


tcttctgtcg 


gctggacgec 


gtctctgcat 


ggatcgattg 


7367 


atgcttctgc 


taatgtgaaa 


teaaatgtta 


taaatcaagc 


gggtgcgggt 


aaattaaatt 


7427 


aagaaagtga 


aaaaeacaaa 


gggtgetaac 


ctttgtgttt 


tttaattaat 


taaaatgttt 


7487 


attaaettag 


ttaaggagta 


gaatggaaaa 


ggggatcgga 


aaacaagtat 


ataggaggag 


7547 


acctatttat 


ggcttcagaa 


aaagacgcag 


gaaaacagtc 


agcagtaaag 


cttgttccat 


7607 


tgcttattac 


tgtcgctgtg 


ggactaatca 


tctggtttat 


tcccgctccg 


tccggacttg 


7667 


aaeetaaagc 


ttggeatttg 


tttgegattt 


ttgtegeaac 


aattatcggc 


tttatctcca 


7727 


agcccttgcc 


aatgggtgea 


attgeaattt 


ttgcattggc 


ggttactgca 


ctaactggaa 


7787 


cactatcaat 


tgaggataca 


ttaageggat 


tcgggaataa 


gaeeatttgg 


cttatcgtta 


7847 




tcgcattctt tatttcccgg ggatttatca aaaccggtct cggtgcgaga atttcgtatg 7907 

tattcgttca gaaattcgga aaaaaaaccc ttggactttc ttattcactg ctattcagtg 7967 

atttaatact ttcacctgct attccaagta atacggcgcg tgcaggaggc attatatttc 8027 

ctattatcag atcattatcc gaaacattcg gatcaagccc ggcaaatgga acagagagaa 8087 

aaatcggtgc attcttatta aaaaccggtt ttcaggggaa tctgatcaca tctgctatgt 8147 

tcctgacagc gatggcggcg aacccgctga ttgccaagct ggcccatgat gtcgcagggg 8207 

tggacttaac atggacaagc tgggcaattg ccgcgattgt accgggactt gtaagcttaa 8267 

tcatcacgcc gcttgtgatt tacaaactgt atccgccgga aatcaaagaa acaccggatg 8327 

cggcgaaaat cgcaacagaa aaactgaaag aaatgggacc gttcaaaaaa tcggagcttt 8387 

ccatggttat cgtgtttctt ttggtgcttg tgctgtggat ttttggcggc agcttcaaca 8447 

tcgacgctac cacaaccgca ttgatcggtt tggccgttct cttattatca caagttctga 8507 

cttgggatga tatcaagaaa gaacagggcg cttgggatac gctcacttgg tttgcggcgc 8567 

ttgtcatgct cgccaacttc ttgaatgaat taggcatggt gtcttggttc agtaatgcca 8627 

tgaaatcatc cgtatcaggg ttctcttgga ttgtggcatt catcatttta attgttgtgt 8687 

attattactc tcactatttc tttgcaagtg cgacagccca catcagtgcg atgtattcag 8747 

catttttggc tgtcgtcgtg gcagcgggcg caccgccgct tttagcagcg ctgagcctcg 8807 

cgttcatcag caacctgttc gggtcaacga ctcactacgg ttctggagcg gctccggtct 8867 

tcttcggagc aggctacatc ccgcaaggca aatggtggtc catcggattt atcctgtcga 8927 

ttgttcatat catcgtatgg cttgtgatcg gcggattatg gtggaaagta ctaggaatat 8987 

ggtagaaaga aaaaggcaga cgcggtctgc ctttttttat tttcactcct tcgtaagaaa 9047 

atggattttg aaaaatgaga aaattccctg tgaaaaatgg tatgatctag gtagaaagga 9107 

cggctggtgc tgtggtgaaa aagcggttcc atttttccct gcaaacaaaa ataatggggc 9167 

tgattgcggc tctgctggtc tttgtcattg gtgtgctgac cattacgtta gccgttcagc 9227 

atacacaggg agaacggaga caggcagagc agctggcggt tcaaacggcg agaaccattt 9287 

cctatatgcc gccggttaaa gagctcattg agagaaaaga cggacatgcg gctcagacgc 9347 

aagaggtcat tgaacaaatg aaagaacaga ctggtgcgtt tgccatttat gttttgaacg 9407 

aaaaaggaga cattcgcagc gcctctggaa aaagcggatt aaagaaactg gagcgcagca 94 67 

gagaaatttt gtttggcggt tcgcatgttt ctgaaacaaa agcggatgga cgaagagtga 9527 

tcagagggag cgcgccgatt ataaaagaac agaagggata cagccaagtg atcggcagcg 9587 
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tgtctgttga 


ttttctgcaa 


acggagacag 


agcaaagcat 


caaaaagcat 


ttgagaaatt 


9647 


tgagtgtgat 


tgctgtgctt 


gtactgctgc 


tcggatttat 


tggcgccgcc 


gtgctggcga 


9707 


aaagcatcag 


aaaggatacg 


ctcgggcttg 


aaccgcatga 


gatcgcggct 


ctatatcgtg 


9767 


agaggaacgc 


aatgcttttc 


gcgattcgag 


aagggattat 


tgccaccaat 


cgtgaaggcg 


9827 


tcgtcaccat 


gatgaacgta 


tcggcggccg 


agatgctgaa 


gctgcccgag 


cctgtgatcc 


9887 


atcttcctat 


agatgacgtc 


atgccgggag 


cagggctgat 


gtctgtgctt 


gaaaaaggag 


9947 


aaatgctgcc 


gaaccaggaa 


gtaagcgtca 


acgatcaagt 


gtttattatc 


aatacgaaag 


10007 


tgatgaatca 


aggcgggcag 


gcgtatggga 


ttgtcgtcag 


cttcagggag 


aaaacagagc 


10067 


tgaagaagct 


gatcgacaca 


ttgacagagg 


ttcgcaaata 


ttcagaggat 


ctcagggcgc 


10127 


agactcatga 


attttcaaat 


aagctttatg 


cgattttagg 


gctgc 




10172 



<210> 14 

<211> 380 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 14 

Met Lys Lys Pro Leu Gly Lys lie Val Ala Ser Thr Ala Leu Leu 
-190 -185 -180 



lie Ser Val Ala Phe Ser Ser Ser lie Ala Ser Ala Ala Thr Gly 
-175 -170 -165 



Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala Val Ser Met 
-160 -155 -150 



Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr Ser Ala Glu Ala 
-145 -140 -135 



Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val Asp Glu 
-130 -125 -120 



Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly Ser Val Phe 
-115 -110 -105 



Asp Thr Glu Ser Leu Glu Leu Thr Val Leu Val Thr Asp Ala Ala Ala 
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-100 



-95 



-90 



Val Glu Ala Val 
-85 



Gly lie Asp Gly 
-70 



Ala Val Pro Gly 



Val Val Leu Glu 

-35 



Leu Ala Asp Ala 
-20 



Asp Gin Pro Glu 
-5 



Met Gly Gly Arg 



Gin Pro Gly Phe 

30 



Val Thr lie Gly 
45 



Gly Asn Asp Ala 
60 



Asn Leu Val Ser 
75 



His Asn Gin Ala 



Thr Gly Trp His 

110 



Tyr Pro Glu Gly 
125 



Glu Ala Thr Gly 
-80 



Leu Asp Glu lie 
-65 



Val Val Gly Trp 
-50 



Val Leu Glu Gly 



Gly Val Asp Ala 

-15 



Leu Tyr Ala Asp 
-1 1 



Cys Ser Val Gly 
15 



Val Thr Ala Gly 



Asn Gly Arg Gly 

50 



Ala Phe Val Arg 
65 



Arg Tyr Asn Thr 
80 



Pro lie Gly Ser 
95 



Cys Gly Thr lie 



Thr Val Thr Asn 

130 



Ala Gly Thr Glu 

-75 



Val Gin Glu Leu 
-60 



Tyr Pro Asp Val 
-45 



Ser Gly Ala Asp 
-30 



Ser Ala Val Glu 



He He Gly Gly 
5 



Phe Ala Ala Thr 
20 



His Cys Gly Arg 
35 



Val Phe Glu Gin 



Gly Thr Ser Asn 

70 



Gly Gly Tyr Ala 
85 



Ser Val Cys Arg 
100 



Gin Ala Arg Gly 
115 



Met Thr Arg Thr 
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Leu Val Ser Tyr 



Asn Ala Ala Asp 

-55 



Ala Gly Asp Thr 
-40 



Val Ser Gly Leu 
-25 



Val Thr Thr Ser 
-10 



Leu Ala Tyr Thr 

10 



Asn Ala Ala Gly 
25 



Val Gly Thr Gin 
40 



Ser Val Phe Pro 
55 



Phe Thr Leu Thr 



Thr Val Ala Gly 

90 



Ser Gly Ser Thr 
105 



Gin Ser Val Ser 
120 



Thr Val Cys Ala 
135 



Glu Pro Gly Asp Ser Gly Gly Ser Tyr lie Ser Gly Thr Gin Ala Gin 

140 145 150 



Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Arg Thr Gly Gly Thr Thr 

155 160 165 170 



Phe Tyr Gin Glu Val Thr Pro Met Val Asn Ser Trp Gly Val Arg Leu 

175 180 185 



Arg Thr 



<210> 15 

<211> 35 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 15 

ggagctctga aaaaaaggag aggataaaga atgaa 

<210> 16 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 16 

gcgttccgat aatcgcggtg acaatgccg 

<210> 17 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 17 

ttcatgagtc tgcgccctga gatcctctg 

<210> 18 

<211> 30 

<212> DNA 



30 



<213> Artificial 



<220> 

<223> Primer 
<400> 18 

taatcgcatg ttcaatccgc tccataatcg 30 



<210> 19 

<211> 44 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 19 

cccaacggtt tcttcattct ttatcctctc ctttttttca gage 44 



<210> 20 

<211> 1164 

<212> DNA 

<213> Artificial 

<220> 

<223> Protease 22 



<220> 

<221> CDS 

<222> (1)..(1164) 

<220> 

<221> sig_peptide 
<222> (1) . . (81) 

<220> 

<221> mat_peptide 
<222> (577) . . (1164) 

<400> 20 

atg aaa aaa ccg ctg gga aaa att gtc gca age aca gca ctt ctt 45 
Met Lys Lys Pro Leu Gly Lys lie Val Ala Ser Thr Ala Leu Leu 
-190 -185 -180 

att tea gtg gca ttt age tea tct att gca tea gca get aca gga 90 
lie Ser Val Ala Phe Ser Ser Ser lie Ala Ser Ala Ala Thr Gly 
-175 -170 -165 

gca tta ccg eag tct ccg aca ccg gaa gca gat gca gtc tea atg 135 
Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala Val Ser Met 
-160 -155 -150 

caa gaa gca ctg caa aga gat ctt gat ctt aca tea gca gaa gca 180 
Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr Ser Ala Glu Ala 
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-145 



-140 



-135 



gaa gaa ctt ctt get gca caa gat aca gca ttt gaa gtg gat gaa 225 

Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val Asp Glu 
-130 -125 -120 

gca gcg gca gaa gca gca gga gat gca tat ggc ggc tea gtt ttt 270 
Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly Ser Val Phe 
-115 -110 -105 

gat aca gaa tea ctt gaa ctt aca gtt ctt gtt aca gat gca gca gca 318 
Asp Thr Glu Ser Leu Glu Leu Thr Val Leu Val Thr Asp Ala Ala Ala 
-100 -95 -90 

gtt gaa gca gtt gaa gca aca gga gca gga aca gta ctt gtt tea tat 366 
Val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Val Leu Val Ser Tyr 
-85 -80 -75 

gga att gat ggc ctt gat gaa att gtt caa gaa ctg aat gca get gat 414 
Gly lie Asp Gly Leu Asp Glu lie Val Gin Glu Leu Asn Ala Ala Asp 
-70 -65 -60 -55 

get gtt ceg ggc gtt gtt ggc tgg tat ceg gat gtt get gga gat aca 4 62 

Ala Val Pro Gly Val Val Gly Trp Tyr Pro Asp Val Ala Gly Asp Thr 

-50 -45 -40 

gtt gtc ctt gaa gtt ctt gaa gga tea ggc gca gat gtt tea ggc ctg 510 
Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val Ser Gly Leu 

-35 -30 -25 

ctg gca gae gca gga gtc gat gca tea gca gtt gaa gtt aca aca tea 558 
Leu Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu Val Thr Thr Ser 
-20 -15 -10 

gat caa ceg gaa ctt tat gca gat att att ggc ggc ctg gca tat tat 606 
Asp Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu Ala Tyr Tyr 
-5 -11 5 10 

atg ggc ggc aga tge age gtt ggc ttt gca gca aca aat gca tea ggc 654 
Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr Asn Ala Ser Gly 

15 20 25 

caa ceg ggc ttt gtt aca gca ggc eat tge ggc aca gtt ggc aca cea 702 
Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Thr Val Gly Thr Pro 

30 35 40 

gtt tea att ggc aat ggc aaa ggc gtt ttt gaa ega age att ttt ceg 750 
Val Ser lie Gly Asn Gly Lys Gly Val Phe Glu Arg Ser lie Phe Pro 
45 50 55 

ggc aat gat tea gca ttt gtt aga ggc aca tea aat ttt aca ctt aca 798 
Gly Asn Asp Ser Ala Phe Val Arg Gly Thr Ser Asn Phe Thr Leu Thr 
60 65 70 

aat ctg gtt tea aga tat aat tea ggc ggc tat gca aca gtt gca ggc 846 
Asn Leu Val Ser Arg Tyr Asn Ser Gly Gly Tyr Ala Thr Val Ala Gly 
75 80 85 90 
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cat aat caa gca ccg att ggc tea gca gtt tgc aga tea ggc tea aca 894 
His Asn Gin Ala Pro lie Gly Ser Ala Val Cys Arg Ser Gly Ser Thr 

95 100 105 



aca ggc tgg cat tgc ggc aca att caa gca aga aat caa aca gtt agg 942 
Thr Gly Trp His Cys Gly Thr lie Gin Ala Arg Asn Gin Thr Val Arg 

110 115 120 

tat ccg caa ggc aca gtt tat agt ctg aca aga aca aca gtt tgt gca 990 
Tyr Pro Gin Gly Thr Val Tyr Ser Leu Thr Arg Thr Thr Val Cys Ala 

125 130 135 

gaa ccg ggc gat tea ggc ggc tea tat att age ggc act caa gca caa 1038 
Glu Pro Gly Asp Ser Gly Gly Ser Tyr lie Ser Gly Thr Gin Ala Gin 

140 145 150 

ggc gtt aca tea ggc ggc tea ggc aat tgc agt get ggc ggc aca aca 1086 
Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Ser Ala Gly Gly Thr Thr 
155 160 165 170 

tat tac caa gaa gtt aat ccg atg ett agt tea tgg ggc ctt aca ctt 1134 
Tyr Tyr Gin Glu Val Asn Pro Met Leu Ser Ser Trp Gly Leu Thr Leu 

175 180 185 

aga aca caa teg cat gtt caa tec get cea 1164 

Arg Thr Gin Ser His Val Gin Ser Ala Pro 

190 195 



<210> 21 

<211> 388 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 21 

Met Lys Lys Pro Leu Gly Lys lie Val Ala Ser Thr Ala Leu Leu 
-190 -185 -180 



lie Ser Val Ala Phe Ser Ser Ser lie Ala Ser Ala Ala Thr Gly 
-175 -170 -165 



Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala Val Ser Met 
-160 -155 -150 



Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr Ser Ala Glu Ala 
-145 -140 -135 



Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val Asp Glu 
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-130 



-125 



-120 



Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly Ser Val Phe 
-115 -110 -105 



Asp Thr Glu Ser Leu Glu Leu Thr Val Leu Val Thr Asp Ala Ala Ala 
-100 -95 -90 



Val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Val Leu Val Ser Tyr 
-85 -80 -75 



Gly lie Asp Gly Leu Asp Glu lie Val Gin Glu Leu Asn Ala Ala Asp 
-70 -65 -60 -55 



Ala Val Pro Gly Val Val Gly Trp Tyr Pro Asp Val Ala Gly Asp Thr 

-50 -45 -40 



Val Val Leu Glu Val Leu Glu Gly Ser Gly Ala Asp Val Ser Gly Leu 

-35 -30 -25 



Leu Ala Asp Ala Gly Val Asp Ala Ser Ala Val Glu Val Thr Thr Ser 
-20 -15 -10 



Asp Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu Ala Tyr Tyr 
-5 -11 5 10 



Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr Asn Ala Ser Gly 

15 20 25 



Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Thr Val Gly Thr Pro 

30 35 40 



Val Ser lie Gly Asn Gly Lys Gly Val Phe Glu Arg Ser He Phe Pro 
45 50 55 



Gly Asn Asp Ser Ala Phe Val Arg Gly Thr Ser Asn Phe Thr Leu Thr 
60 65 70 



Asn Leu Val Ser Arg Tyr Asn Ser Gly Gly Tyr Ala Thr Val Ala Gly 
75 80 85 90 



His Asn Gin Ala Pro He Gly Ser Ala Val Cys Arg Ser Gly Ser Thr 

95 100 105 
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Thr Gly Trp His Cys Gly Thr lie Gin Ala Arg Asn Gin Thr Val Arg 

110 115 120 



Tyr Pro Gin Gly Thr Val Tyr Ser Leu Thr Arg Thr Thr Val Cys Ala 

125 130 135 



Glu Pro Gly Asp Ser Gly Gly Ser Tyr lie Ser Gly Thr Gin Ala Gin 

140 145 150 



Gly Val Thr Ser Gly Gly Ser Gly Asn Cys Ser Ala Gly Gly Thr Thr 
155 160 165 170 



Tyr Tyr Gin Glu Val Asn Pro Met Leu Ser Ser Trp Gly Leu Thr Leu 

175 180 185 



Arg Thr Gin Ser His Val Gin Ser Ala Pro 

190 195 
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